Late Integration in Audio-visual Continuous Speech Recognition

نویسندگان

Ashish Verma

Tanveer Faruquie

Chalapathy Neti

Sankar Basu

چکیده

Using visual information in speech recognition has been an area of interest because it can signi cantly improve the speech recognition e ciency in the conditions where audio only recognition su ers due to noisy environment. In this paper, we present a new approach to combine audio and video to improve the robustness of the speech recognition system in the noisy environments. We also compare the results of the new approach with the corresponding results of the approaches proposed earlier in the literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Continuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition

We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal model...

متن کامل

Asynchronous stream modeling for large vocabulary audio-visual speech recognition

This paper addresses the problem of audio-visual information fusion to provide highly robust speech recognition. We investigate methods that make different assumptions about asynchrony and conditional dependence across streams and propose a technique based on composite HMMs that can account for stream asynchrony and different levels of information integration. We show how these models can be tr...

متن کامل

An investigation of HMM classifier combination strategies for improved audio-visual speech recognition

The combining of independent audio and visual HMM classifiers (late integration) has been shown to out perform the combination of audio and visual features in a single HMM classifier (early integration) when either or both modalities are presented with distortion for the task of speech recognition. Theoretical foundations for the optimal combination of these audio and video classifiers are stil...

متن کامل

Continuous Audio-Visual Speech Recognition

We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audio-visual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal mode...

متن کامل

IDIAP Martigny - Valais - Suisse Continuous Audio � Visual Speech Recognition

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Late Integration in Audio-visual Continuous Speech Recognition

نویسندگان

چکیده

منابع مشابه

Continuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition

Asynchronous stream modeling for large vocabulary audio-visual speech recognition

An investigation of HMM classifier combination strategies for improved audio-visual speech recognition

Continuous Audio-Visual Speech Recognition

IDIAP Martigny - Valais - Suisse Continuous Audio � Visual Speech Recognition

عنوان ژورنال:

اشتراک گذاری